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Relevant: Document Display Apparatus and Method 



Background of the Invention 
Field of the Invention 

The present invention relates to a technology of 
supporting the access by users to necessary 
information by properly arranging a group of cross- 
referenced documents and displaying the group of 
documents from various points of views. 

Description of the Related Art 

In electronic conferences, electronic news, etc. 
established through personal computer communications 
or computer networks, there has been a request to 
easily and quickly locate a necessary document in a 
set of sequentially accumulated documents. 

In response to the request, a conventional 
technology has been developed to present the users 
with a list of titles obtained by rearranging the 
titles of documents in a set of documents in the order 
of document generation date. 

Another conventional technology displays cross- 
references of documents by classifying a set of 
documents into groups of cross-referenced documents 
and displaying an indented title of each document in 
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a group of cross-referenced documents, A further 
conventional technology displays cross-references of 
documents by displaying the numbers of the documents 
in a set of documents in a tree format. 
5 A further conventional technology searches entire 

documents in a set of documents for a specific key 
J?i word, and lists search results. 

^ However, since the information displayed by the 

® above described conventional technologies is limited, 

f|i 10 the following problems cannot be solved by the above 

r=T: 

J described conventional technologies. 

fU 1. To retrieve a required document or a group of 

documents probably containing a required document from 

^ ! 15 a set of various documents, the only information to 

be referred to is a list of titles of documents. 
Since all titles do not correctly describe the 
contents of the documents, it is hard to obtain a 
desired retrieval result. 

20 

2. Indented titles and document numbers in a tree 
structure are not enough to correctly inform a user 
of the entire structure of a group of documents and 
of the transition of the subjects in the group of 
2 5 documents . 
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3 . Necessary documents cannot: be accessed from 
various points of view. 

4. When there are a number of retrieval results, 
5 required documents cannot be accessed unless limited 

retrieval is performed or each of the retrieval result 
lists is checked. 

On the other hand, a technology of extracting a 
10 keyword from a plurality of specific documents and 
automatically setting a link from each document 
containing a common keyword to another document is 
well-known. This conventional technology enables a 
plurality of relevant documents to be efficiently read 
15 by setting the documents as cross-referenced in 
specific documents such as published patents, 
technological theses, etc. 

However, the above described conventional 
technology aims at easily referencing relevant 
20 documents, and cannot be used to support the retrieval 
of a necessary document or a group of documents which 
probably contains a necessary document. 
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Summary of the Invention 

The present invention has been developed based 
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on the above described background to support the 
retrieval of a necessary document or a group of 
documents which probably contains a necessary document 
from a set of documents containing a large volume of 
5 documents. 

The present invention is based on a relevant 
document display device for displaying a group of 
documents containing documents which can be cross- 
referenced. 

10 The first aspect of the present invention is 

designed to have the following configuration. 

First, a document group analysis device 
(structure analysis unit 402) classifies a plurality 
of documents forming a set of documents into one or 

15 more groups of documents (thread) which can be cross- 
referenced by analyzing the reference among the 
plurality of documents forming the set of documents, 
and extracts information relating to the 
classification as document group information (thread 

20 index 305 ) . 

A document group keyword extraction device 
(contents analysis unit 403) extracts a keyword 
contained in a document forming part of each group of 
documents by referring to document group information. 

25 A document group keyword display device (display 




5 

device 306) displays each document with a relevant 
title of the document associated with a keyword 
extracted by the document group keyword extraction 
device for each group of documents. 
5 With the above described configuration according 

to the first aspect of the present invention, a group 

-En. 

of documents probably containing a necessary document 
can be easily retrieved from a set of various 
documents using a keyword. 

i I 

rtj 10 According to the second aspect of the present 

J invention, the document group keyword display device 

J=; having the configuration according to the first aspect 

W of the present invention is designed to display with 

£■ enhancement a group of documents containing a document 

" 15 specified by a user for retrieval, or a keyword 

specified for retrieval. 

With the configuration according to the second 
aspect of the present invention, a group of documents 
specified for retrieval can be recognized at a glance. 
20 The third aspect of the present invention has the 

following configuration. 

The document group analysis device ( structure 
analysis unit 402) classifies a plurality of documents 
forming a set of documents into one or more group of 
25 documents which can be cross-referenced by analyzing 
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■the reference among the plurality of documents forming 
the set of documents, and then extracts information 
about the classification as document group 
information . 

5 A document attribute analysis device ( format 

analysis unit 401 ) extracts document attribute 
information (meta-index 304) indicating the attribute 
of a document from each of a plurality of documents 
forming a set of documents. 

10 A document group structure display device 

(display device 306) displays the reference among each 
group of documents by referring document group 
information and document attribute information in a 
tree structure in which each document attribute 

15 information or abbreviated information for each 
document forming part of the group of documents is 
displayed as each node. To be more practical, the 
document group structure display device displays the 
reference of each group of documents among a plurality 

20 of document groups in a tree structure in which an 
abbreviated name of an author of each document forming 
part of the group of documents is combined with 
document volume information (number of lines) to be 
displayed as each node. 

25 With the configuration according to the above 
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described third aspect of the present invention, the 
structure of the entire group of documents can be 
easily recognized . 

According to the fourth embodiment of the present 
5 invention, the document group structure display device 
according to the above described third aspect of the 
present invention further displays a plurality of 
topics (a main topic and a sub-topic) extracted from 
a document contained in each group of document group 

10 by associating them with each node forming part of a 
tree structure displayed corresponding to the group 
of document . To be more practical, the document group 
structure display device displays for each group of 
document each topic and each relevant node in the same 

15 color. 

With the above described configuration according 
to the fourth aspect of the present invention, the 
transition of the subject of the entire group of 
documents can be easily recognized. 

20 According to the fifth embodiment of the present 

invention, the document group structure display device 
according to the above described third aspect of the 
present invention is designed to display with 
enhancement a node corresponding to the document 

25 specified for retrieval by a user. 
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With the above described configuration according 
to the fifth aspect of the present invention, the 
retrieval can be performed with the structure of the 
thread recognized. 

The sixth aspect of the present invention has the 
following configuration. 

The document group analysis device (structure 
analysis unit 402) classifies a plurality of documents 
forming a set of documents into one or more groups of 
documents which can be cross-referenced by analyzing 
the reference among the plurality of documents forming 
the set of documents, and extracts the information 
about the classification as document group 
information . 

A topic analysis device (structure analysis unit 
402) further classifies each of the classified 
document groups based on the topic extracted from a 
document forming part of each group of documents, and 
extracts the information relating to the further 
extraction as topic classification information (sub- 
topic index ) . 

A topic keyword extraction device (contents 
analysis unit 403) extracts a keyword contained in a 
document related to a topic for each group of 
documents and each of the further classified topics 
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of each of the group of documents. 

A topic keyword display device (display device 

306) displays the title, of a document relating to 

each group of documents and each of the further 
5 detailed topics, associated with the keyword extracted 

for each topic by the topic keyword extraction device. 
Jri With the above described configuration according to 

"p: the sixth aspect of the present invention, a precise 

JJ 1 limitation of topics can be performed using a keyword. 

fy 10 The present invention can be realized as a 

Z computer readable storage medium to direct a computer 

J 85 ; to perform the functions similar to those performed 

fti with each of the above described configurations 

yh according to the present invention. 

^ : 15 As described above, a group of documents can be 

accessed from various points of views according to the 
present invention . 



Brief Description of the Drawings 

20 Other objects and features of the present 

invention are easily understood by one of ordinary 
skill in the art by referring to the attached drawings 
and the descriptions of the preferred embodiments of 
the present invention. 

25 FIG. 1 shows an example of group of documents 
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which can be cross-referenced; 

FIG. 2 shows an example of the data structure of 
a document; 

FIG, 3 shows the configuration (1) of the system 
5 according to a preferred embodiment of the present 
invention; 

% FIG. 4 shows the configuration (2) of the system 

O according to a preferred embodiment of the present 

W invention; 

pi 10 FIG. 5 shows the structure of meta- indexed data; 

yi FIG. 6 shows the structure of thread- indexed 

P data; 

HI FIG. 7 shows the data structure of an indexed 

~p! file; 

ffi 15 FIG. 8 is a flowchart showing the operations of 

the format analysis unit and the structure analysis 
unit; 

FIG. 9 is a flowchart showing the operations when 
an entry of a document number is made in the thread 
20 index of a document number; 

FIG. 10 illustrates the operations when an entry 
of a document number is made in the thread index of 
a document number ; 

FIG. 11 is a flowchart showing the operations of 
25 entering color numbers; 
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FIG* 12 shows an example of a color table; and 

FIG. 13 is a flowchart showing the operation of 
a keyword view; 

FIG, 14 is a flowchart showing the operation of 
5 a thread view; 

FIG. 15 is an example of a control array for a 
thread view; 

FIG. 16 is a flowchart showing the operations of 
an author view; 
10 FIG. 17 shows an example of a control array for 

an author view; 

FIG. 18 is a flowchart showing the operations for 
displaying the contents of a message; 

FIG. 19 is a flowchart showing the operations of 
15 displaying authors and dates in respective colors. 

FIGs. 20A and 20B show examples of display arrays 
for authors and dates in respective colors; 

FIG. 21 is a flowchart showing the operations of 
an enhanced display of a retrieval result using thread 
20 views; 

FIG. 22 is a flowchart showing the operations of 
an enhanced display of a retrieval result using 
keyword views; 

FIG. 23 is a flowchart showing the operations of 
25 controlling the extraction of a keyword from a sub- 
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topic; 

FIG. 24 shows an example of a sub- topic index; 
FIG. 25 shows an example of displaying a keyword 

view; 

5 FIG. 26 shows an example of displaying a thread 

view; 

FIG. 27 shows an example of displaying an author 

view; 

FIG. 28 shows an example of displaying the 
10 contents of a message; 

FIG. 29 shows an example of displaying with 
enhancement (in different colors) a document attribute 
'author 1 using a thread view; 

FIG. 30 shows an example of displaying retrieval 
15 (input) of all messages in a message board; 

FIG. 31 shows an example of displaying with 
enhancement a node containing a character string 
'protocol' using a thread view; 

FIG. 32 shows an example of displaying with 
20 enhancement a thread containing a character string 
'protocol' using a keyword view; 

FIG. 33 shows an example of displaying retrieval 
(output of result) of all messages in a message board; 
FIG. 34 shows an example of displaying a keyword 
25 extracted from a sub-topic; 
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FIG. 35 shows the configuration according to the 
second embodiment of the present invention; 

FIG. 36 shows the configuration of the user 
index ; 

5 FIG. 37 shows the configuration of the document 

index; 

FIG. 38 shows the configuration of the thread 
index; 

FIG. 39 shows the configuration of the QA index; 
10 FIG. 40 shows the configuration of the DISCUSS 

index; 

FIG. 41 shows the configuration of the CHAT 
index ; 

FIG. 42 is a flowchart showing the contents 
15 estimation unit; 

FIG. 43 is a flowchart showing the Q and A 
pattern determining process; 

FIG. 44 shows an example of a pattern contained 
in a Q (question) document; 
20 FIG. 45 shows an example of a pattern contained 

in a thanks document; 

FIG. 46 shows an example of a thread structure 
estimated by the Q and A pattern determining process; 
FIG. 47 shows an example of a thread document 
25 group estimated by the Q and A pattern determining 
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process; 

FIG. 48 is a flowchart showing the discussion 
pattern determining process; 

FIG. 49 shows an example of the thread structure 
5 estimated by the Discussion pattern determining 
process; 

FIG. 50 is a flowchart showing the operations 
performed by the Chat pattern determining process; 

FIG. 51 shows an example of the thread structure 
10 estimated by the Chat pattern determining process; 

FIG. 52 shows a display example of an original 
thread structure; 

FIG. 53 shows a display example of an original 
thread structure; 
15 FIG. 54 shows a display example of an original 

thread structure curtailed based on the number of 
reference nodes; 

FIG. 55 shows a display example of a thread 
structure obtained by curtailing documents having the 
20 same title; 

FIG. 56 shows a display example of a thread 
structure retrieved in time period units; 

FIG. 57 shows the graph structure containing 
authors as nodes; 
25 FIG. 58 shows a display example of the network 
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news retrieval system ; 

FIG. 59 shows a list; of retrieval screens; 

FIG* 60 shows the output result 1: a list of 
threads; 

5 FIG. 61 shows the output result 2: display of 

thread structure 

FIG. 62 shows the output result 3: display of 
time period thread ; 

FIG. 63 is a flowchart of generating a display 
10 of time period thread; 

FIG. 64 shows the configuration of the data of 
a calendar index; 

FIG. 65 is a flowchart of generating an author 
node graph; 

15 FIG. 66 shows the configuration of the data of 

author array ; 

FIG. 67 shows the generation of an author node 
graph; and 

FIG. 68 shows a storage medium storing a program 
20 for realizing a preferred embodiment of the present 
invention. 
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Description of the preferred embodiments 

The preferred embodiment of the present invention 
are described in detail below by referring to the 
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attached drawings . 

General configuration 

FIG. 1 shows an example of a set of documents and 
5 a group of documents to be processed in the preferred 
embodiments of the present invention. 

In this example, there are virtual public 
conference halls, through a computer network, referred 
to as forums in which users discuss for respective 

10 subjects. Each forum is divided into a plurality of 
virtual places referred to as message boards in which 
further detailed subjects are discussed about. 
Discussions proceed with documents of messages 
uploaded. Forums and message boards are defined as 

15 storage areas in a server computer, and store 
documents according to the above described division 
into virtual places. In each message board, a group 
of documents having a plurality of documents which can 
be cross-referenced forms a thread. 

20 A document provided by a user has, for example, 

a data structure shown in FIG. 2, and includes a 
header portion containing document attribute fields 
such as a document number indicating the number of the 
document, a date, a title, and a referenced document 

25 number to which the document refers, name of an 
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author, etc. and a contents portion describing the 
body of the document. 

The following display formats are provided 
according to the preferred embodiment of the present 
invention. 

1. Keyword view ; In a set of documents forming a 
message board, a keyword is extracted for each thread 
from a group of documents forming the thread. The 
extracted keywords are displayed in the format shown 
in FIG. 25 together with the number of the documents 
and the titles of the threads containing the 
documents. 

Through the keyword view, a user can easily find 
a thread (group of documents) probably containing a 
necessary document in a set of various documents based 
on a keyword. 

2. Thread view ; A group of documents forming each 
thread are displayed in the display J format shown in 
FIG. 26 in such a way that the references, titles, 
authors, and number of lines of document can be 
simultaneously recognized . 

Through the thread view, the structure of the 
entire thread can be recognized, and the transition 
of a subject can be easily understood. 
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3. Author view : The titles of documents are 
classified based on authors, and sorted in order from 
largest to smallest number of messages. For the same 
author, the titles are sorted and displayed in the 
5 order of date as shown in FIG. 27. 

From the viewpoint of an author and a date of a 
y message, a document in a set of documents (message 

O board) can be referred to. 

5- 4. Reflection of retrieval result on each view: A 

^; 10 document relating to a user-specified retrieval 

keyword is displayed with enhancement in a keyword 
p view, a thread view, etc. in the display format shown 

jfj in FIG. 32 or 33. 

^ As a result, documents can be more correctly 

15 recognized. 

5. Switch of each view : A necessary document can be 
accessed from various points of views by optionally 
switching the above described keyword view, thread 
view, and an author view. 

20 Described below in detail are the preferred 

embodiments capable of displaying in the above 
described methods according to the present invention. 

FIGs. 3 and 4 show configurations of the systems 
according to the preferred embodiments of the present 

25 invention. 
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The group of documents in the forum/message board 
are accumulated as a document group database 301 in 
a predetermined server computer, 

A document group analysis device 302 analyzes a 
5 group of documents contained in each set of document 
corresponding to each message board in the document 
group database 301 . 

A summation device 303 generates a meta-index 
304, a thread index 305, and an indexed file 404 based 
10 on the analysis result from the document group 
analysis device 302. 

A display device 306 displays a group of 
documents in any of the keyword view, thread view, and 
author view formats through the meta-index 304 and the 
15 thread index 305. 

A character string retrieval device 405 retrieves 
each document forming part of the set of documents in 
the document group database 301 using the indexed file 
404 according to a user-specified word to be 
20 retrieved, display device 306 reflects the retrieval 
result on the keyword view or the thread view and 
displays the result. 

The display device 306 displays a group of 
documents in any format of the keyword view, thread 
25 view, and author view using the meta-index 304 and the 
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thread index 305. 

The document group analysis device 302 comprises 

a format analysis unit 401, a structure analysis unit 

402, and a contents analysis unit 403. 
5 The format analysis unit 401 extracts a document 

number, title, name of author, date, and reference 
^ document number from a header portion of each document 

C having the data structure shown in FIG . 2 forming part 

gj of the set of documents in the document group database 

Si 10 301, computes the number of lines of the contents 

^ ; portion of each document, and enters them in the meta- 

C index 304 having the data structure shown in FIG. 5 

r|; through the summation device 303. 

^/ The structure analysis unit 402 divides each 

W 15 document into groups of documents in thread units 

based on a document number and a reference document 
number extracted by the format analysis unit 401 from 
each document, and generates the thread index 305 
which is a reference list of documents forming each 
20 thread through the summation device 303. 

FIG. 6 shows the data structure of the thread 
index 305. 

A list showing for each thread a root document 
number, the number of documents, and the structure of 
25 a thread is entered. The list is represented as 
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follows. 

(parent document number child document 
number/sub-tree child document number/sub-tree ) 

5 

The portion 'child document number/sub-tree' can 
further include a child list recursively. 

Each reference represented in each of the 
listings of the two threads shown in FIG. 6 is 

10 indicated to the right of the list. 

The structure analysis unit 402 further divides 
groups of documents forming an analyzed thread into 
groups of sub-documents having the same titles, 
assigns a color number to each of the groups of sub- 

15 documents, and enters color numbers assigned to the 
sub-document group of a document in an entry of the 
meta-index 304 having the data structure shown in FIG. 
5 corresponding to the document in each sub-document. 
The contents analysis unit 403 combines a group 

20 of documents forming each thread divided by the 
structure analysis unit 402 in a single document file, 
and extracts a keyword from the combined documents. 
A technology of extracting a keyword from a Japanese 
document can be found in various well-known 

25 technologies. To extract a keyword from a Japanese 
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document with higher precision, a character which 
becomes a noise is removed through a pattern matching. 
Additionally, for example, only the predetermined 
number of higher order keywords are extracted. 
5 The keyword of each thread extracted by the 

contents analysis unit 403 is entered in an entry of 
the met a- index 304 having the data structure shown in 
FIG. 5 corresponding to the root document of the 
thread through the summation device 303. 
10 The contents analysis unit 403 extracts an 

indexed word contained in a keyword extracted for each 
thread, and generates the indexed file 404 having the 
data structure shown in FIG. 7 through the summation 
device 303. 

15 The indexed file 404 is referred to by the 

character string retrieval device 405 as described 
above . 

Detailed explanation of the document group analysis 

20 device 302 

FIG. 8 is a flowchart showing the control 
operations realized by the format analysis unit 401 
and the structure analysis unit 402 in the document 
group analysis device 302 shown in FIG. 4. 

25 The format analysis unit 401 reads document data 
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in line units from a new document file newly entered 
from the document group database 301, and extracts the 
document number, title, name of an author, date, and 
reference number from the header portion (refer to 
5 FIG. 2) of the document file (loop in steps 801, 802, 
803, and back to 801), 
hi ! When the format analysis unit 401 completes 

Ci extracting the data from the header portion, it 

ff\ generates a new entry in the meta-index 304 having the 

~j 10 data structure shown in FIG. 5 through the summation 

ff 1 device 303, and enters in the entry the extracted 

O document number, title, name of an author, date, and 

reference document number (steps 802 and 804). 
U J Then, the format analysis unit 401 reads document 

W 15 data in line units from the contents portion (refer 

to FIG. 2) after the header portion in the above 
described new document file, and counts the number of 
lines in the contents portion until the end of the 
file (EOF) is detected (loop in steps 805, 806, 807, 
20 and back to 805 ) . 

When the end of the file is detected, the format 
analysis unit 401 enters the number of lines counted 
up to the current point through the summation device 
303 in the contents portion in the entry, 
25 corresponding to the document number of the new 
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document; being processed, in the meta-index 304 having 
the data structure shown in FIG, 5 (steps 806 and 
808), 

In the above described line number counting 
process, the line recited from another document (for 
example, a line starting with 1 > ' ) can be designed not 
to be counted in the line number counting process so 
that the number of lines actually related to the 
message can be counted in the current document. 

Then, control is passed to the structure analysis 
unit 402, and the structure analysis unit 402 first 
enters the document number of the new document being 
processed in the thread index 305 having the data 
structure shown in FIG, 6 through the summation device 
303 (step 809). 

FIG. 9 is a flowchart showing the operations 
entered in step 809. 

First, the structure analysis unit 402 determines 
whether or not the new document being processed is a 
root document of a certain thread (step 901). 
Practically, when no reference document number can be 
detected in a new document being processed in the loop 
in steps 801 to 803 shown in FIG. 8, the structure 
analysis unit 402 determines that the document is a 
root document . 
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When "the structure analysis unit 402 determines 
that a new document being processed is a root document 
of a certain thread, it generates a new entry in the 
thread index 305 through the summation device 303, and 
5 enters the document number of the new document being 
processed in the entry as a root document number 
(steps 901 and 902). 

The structure analysis unit 402 initializes the 
number of documents in the above described entry to 

10 1 after the process in step 902 (step 906), and 
terminates the process in step 809 shown in FIG, 8. 

When the structure analysis unit 402 determines 
that a new document being processed is not a root 
document of a certain thread, it determines whether 

15 or not a list exists in an entry in the thread index 
305 containing the reference document number, which 
is the parent document number of the list, extracted 
from the header portion of the new document being 
processed ( step 901 and 903 ) . 

20 If the structure analysis unit 402 determines 

that a list having as a parent document number a 
reference document number extracted from the header 
portion of a new document being processed exists in 
the above described entry, then the structure analysis 

25 unit 402 enters the document number extracted from the 
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header porion of -the new document being processed as 
a child document number of the list (steps 903 and 

905) . 

If the structure analysis unit 402 determines 
5 that a list having as a parent document number a 
reference document number extracted from the header 
^ portion of a new document being processed does not 

O exist in the above described entry, then the structure 

m analysis unit 402 generates in the entry a list having 

irj 10 the reference document number as a parent document 

^ ! number, and enters the document number extracted from 

Ci the header portion of the new document being processed 

i?j as a child document number of the list (steps 903 

^; through 905) . 

W 15 After the process in step 905, the structure 

analysis unit 402 updates ( increments by 1 ) the number 
of the documents in the above described entry ( step 

906) , and terminates the process in step 809 shown in 
FIG. 8. 

20 A practical example of the control operation 

realized according to the flowchart of the operations 
shown in FIG, 9 is explained by referring to FIG. 10. 
FIG. 10 shows the process of generating the list of 
entries of the thread whose root document number is 

25 001 in the thread index 305 shown in FIG. 6. 
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When a root document having the document number 
001 is processed, a new entry is generated in the 
thread index 305 by performing the processes in steps 
901 and 902 in FIG. 9, the document number 001 is 
5 entered as a root document number in the entry ( ( 1 ) 
in FIG. 10), and the number of documents of the above 
described „ entry is initialized to 1. 

Next, after a list having as a parent document 
number the reference document number 001 extracted 

10 from the document having the document number 002 is 
generated in the entry of the root document number 001 
in the thread index 305 by performing the processes 
in steps 901, 903, 904, 905, and 906 shown in FIG. 9 
when the document having the document number 002 is 

15 processed ((2) in FIG. 10), the document number 002 
is entered as a child document number of the list 
(underlined portion of (3) shown in FIG. 10), and the 
number of documents in the above described entry is 
updated into 2 . 

20 Next, after a list having as a parent document 

number the reference document number 002 extracted 
from the document having the document number 003 is 
generated in the entry of the root document number 001 
in the thread index 305 by performing the processes 

25 in steps 901, 903, 904, 905, and 906 shown in FIG. 9 
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when the document having the document number 003 is 
processed (underlined portion of (4) in FIG. 10), the 
document number 003 is entered as a child document 
number of the list (underlined portion of (5) shown 
5 in FIG. 10), and the number of documents in the above 
described entry is updated into 3. 
^! Next, the document number 004 is entered as a 

«| child document number of the list (underlined portion 

W of (6) shown in FIG. 10) having as a parent document 

«a=. 

Rj 10 number the reference document number 003 extracted 

ys from the document having the document number 004 in 

D the entry of the root document number 001 in the 

m thread index 305 by performing the processes in steps 

901, 903, 904, 905, and 906 shown in FIG. 9 when the 
^ j 15 document having the document number 004 is processed, 

and the number of documents in the above described 
entry is updated into 4 . 

Next, after a list having as a parent document 
number the reference document number 004 extracted 
20 from the document having the document number 005 is 
generated in the entry of the root document number 001 
in the thread index 305 by performing the processes 
in steps 901, 903, 904, 905, and 906 shown in FIG. 9 
when the document having the document number 005 is 
25 processed (underlined portion of (7) in FIG. 10), the 
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document: number 005 is entered as a child document 
number of the list (underlined portion of (8) shown 
in FIG. 10), and the number of documents in the above 
described entry is updated into 5. 
5 Finally, after a list having as a parent document 

number the reference document number 005 extracted 
4; 1 from the document having the document number 006 is 

■Kits:- 

G generated in the entry of the root document number 001 

go in the thread index 305 by performing the processes 

^; 10 in steps 901, 903, 904, 905, and 906 shown in FIG. 9 

ff j when the document having the document number 006 is 

O processed (underlined portion of (9) in FIG. 10), the 

JJi document number 006 is entered as a child document 

^ number of the list (underlined portion of (10) shown 

W 15 in FIG. 10), and the number of documents in the above 

described entry is updated into 6. 

After the above described process in step 809 
shown in FIG. 8, the structure analysis unit 402 
determines the color number for the new document being 
20 processed, and performs the process of entering the 
color number in the entry corresponding to the 
document number of the above described new document 
in the met a- index 304 having the data structure shown 
in FIG. 5 (step 810 shown in FIG. 8). 
25 FIG. 11 is a flowchart showing the entry 
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operation in step 810. In the entry operation, a 
color table having the data structure shown in FIG. 
12 is used. . This table is stored in the storage 
device not shown in the attached drawings. 
5 First, the structure analysis unit 402 determines 

whether or not the new document being processed is a 
C root document of a certain thread (step 1101). 

p Practically, the structure analysis unit 402 

X\ determines that a new document being processed is a 

G 10 root document when no reference document number is 

On detected in the document in the loop in steps 801 

£*: through 803 shown in FIG. 8. 

Jjj When the structure analysis unit 402 determines 

tt\ that the new document being processed is a root 

f3= 15 document of a certain thread, the entry corresponding 

to the document number of the root document is entered 
in a color table having the data structure shown in 
FIG. 12, and the document number and the title 
extracted from the new document being processed (refer 
20 to step 804 shown in FIG. 8) and the initial color 
number are entered in the entry (steps 1101, 1102, and 
1103). In an example shown in FIG. 12, an entry 
corresponding to the color number #1 of the root 
document number 001 is entered, the title becomes a 
25 main topic, and only the root document number 001 is 



initially entered in the document number field of the 
entry. 

Then, the structure analysis unit 402 enters an 
initial color number entered in step 1103 in the entry 
corresponding to the document number of the above 
described new document in the meta- index 304 having 
the data structure shown in FIG. 5 (steps 1103 and 
1110), thereby terminating the process in step 810 
shown in FIG. 8. 

When the structure analysis unit 402 determines 
that a new document being processed is not a root 
document of a certain thread, it further determines 
whether or not the title extracted from the new 
document being processed (refer to step 804 shown in 
FIG. 8) contains a reference code such as 'Re: ' , etc. 
(steps 1101 and 1104). 

When the structure analysis unit 402 determines 
that the title extracted from the new document being 
processed contains a reference code, the reference 
code is deleted from the title (steps 1104 and 1105). 
When the structure analysis unit 402 determines that 
the title extracted from the new document being 
processed does not contain a reference code, the 
process in step 1105 is not performed. 

Then, the structure analysis unit 402 determines 
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whether or not the same title that does not contain 
the reference code and has been extracted from the new 
document being processed has been entered in any of 
the entries corresponding to the thread to which the 
5 new document being processed in the color table having 
the structure shown in FIG. 12 belongs (step 1106). 
Since the thread to which the new document being 
processed belongs and the root document number are 
detected when the entry in the thread index 305 having 

10 the data structure shown in FIG. 6 is determined, the 
entry in the color table is determined from the root 
document number. For example, when the new document 
being processed has the document number 002, an entry 
belonging to the root document number 001 is detected 

15 in the color table shown in FIG. 12. 

When the structure analysis unit 402 determines 
that the same title that does not contain the 
reference code and has been extracted from the new 
document being processed has been entered in any of 

20 the entries corresponding to the thread to which the 
new document being processed in the color table having 
the structure shown in FIG. 12 belongs, the structure 
analysis unit 402 enters the document number of the 
new document being processed in the document number 

25 field of the entry (steps 1106 and 1107). For 
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example, when the new document being processed has the 
document number 002, the document number 002 is 
entered in the document number field of the entry 
belonging to the root document number 001 and having 
an entered color number #1 in the color table shown 
in FIG. 12. 

Then, in the entry corresponding to the document 
number of the above described new document in the 
meta-index 304 having the data structure shown in FIG. 
5, the structure analysis unit 402 enters the color 
number set in the entry in the color table entered in 
step 1107 (steps 1107 and 1110), thereby terminating 
the process in step 810 shown in FIG. 8. 

When the structure analysis unit 402 determines 
that the same title that does not contain the 
reference code and has been extracted from the new 
document being processed has not been entered in any 
of the entries corresponding to the thread to which 
the new document being processed in the color table 
having the structure shown in FIG. 12 belongs, a new 
entry corresponding to the above described thread in 
the color table is generated (step 1108), and a new 
color number in the thread, and a document number and 
title (step 804 shown in FIG. 8) extracted from the 
new document being processed are entered in the 
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generated entry (steps 1106, 1108, and 1109), For 
example, when the new document being processed has the 
document number 003, a new entry belonging to the root 
document number 001 is generated in the color table 
5 shown in FIG, 12, and the color number #2, the title 
of the document having the document number 003, and 
the document number 003 are entered in the generated 
entry. This title becomes a sub-topic 1 to the main 
topic which is the title of the root document number 
10 001. 

Then, in the entry corresponding to the document 
number of the above described new document in the 
meta-index 304 having the data structure shown in FIG. 
5, the structure analysis unit 402 enters a new color 

15 number set in the new entry in the color table in step 
1109 (steps 1109 and 1110), thereby terminating the 
process in step 810 shown in FIG. 8. 

The contents analysis unit 403 combines a group 
of documents forming each thread as a combined 

20 document file as described above by referring to the 
thread index 305 having the data structure shown in 
FIG. 6, and extracts a keyword from the combined 
documents. As a result, the extracted keyword of each 
thread is entered in the entry of the meta-index 304 

25 having the data structure shown in FIG. 5 
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corresponding to "the root document of the thread. 

Detailed explanation about the display device 306 

As described above, the display device 306 can 
5 display a group of documents in the display format of 
the keyword view, the thread view, or the author view 
using the meta- index 304 having the data structure 
shown in FIG. 5 and the thread index 305 having the 
data structure shown in FIG. 6. 

10 In this embodiment, if the system shown in FIG. 

4 is designed to be connected to a Web server for 
controlling the display of a home page, then a user 
logs in a specific message board of a specific forum 
by connecting a Web browser application in the 

15 terminal at hand such as a personal computer, etc. to 
the above described Web server, and then switches and 
displays the keyword view, the thread view, or the 
author view by clicking a predetermined GUI (graphic 
user interface) button using a mouse, etc. 

20 To be more practical, the display device 306 

functions as, for example, a CGI (common gateway 
interface) application for a Web server, generates 
document data in an HTML (hyper text markup language) 
representing each of the keyword view, thread view, 

25 and author view, etc. in response to a user request 



received through the Web server, and transmits the 
generated data to the Web server. When the Web server 
returns the HTML document data to the user through a 
computer network such as the Internet, etc., the above 
described view is displayed on the Web browser 
application on the user terminal unit. 

First the operations of displaying a keyword view 
realized by the display device 306 are described 
below. 

In the keyword view described above, a keyword 
extracted from a group of documents forming each 
thread is displayed in the display format shown in 
FIG. 25 together with the number of documents in the 
group of documents and the title of the thread. 

FIG. 13 is a flowchart showing the operations of 
displaying a keyword view performed by the display 
device 306. 

First, the display device 306 reads a file of the 
meta- index 304 having the data structure shown in FIG. 
5 (step 1301). 

Then, the display device 306 retrieves an entry 
in which a root document is entered while reading data 
entry by entry from a file in the meta-index 304 (loop 
in steps 1301, 1302, and back to 1301). It is 
determined whether or not the root document of each 
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entry has been entered. 

When the display device 306 detects an entry 
whose root document has been entered, it converts the 
root document number into an anchor tag whose URL 
(Uniform Resource Locator), that is, united address 
information for an application for displaying the root 
document in the document group database 301 
corresponding to the root document number, is 
specified as a value of an HREF (Hypertext REFerence) 
attribute (steps 1302 and 1303). 

Then, the display device 306 obtains the number 
of documents (number of child documents) contained in 
a thread by referring to the entry corresponding to 
the above described root document number in the thread 
index 305 having the data structure shown in FIG. 6 
(step 1304). 

The display device 306 extracts a title (main 
topic) and a keyword from an entry whose root document 
has been entered in the meta-index 304 having the data 
structure shown in FIG. 5, and generates HTML table 
document data containing as one table record the above 
described title and keyword, a root document number 
in the anchor tag format converted in step 1303, and 
a data string comprising the number of child documents 
obtained in step 1304 (step 1305). 
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Next, the display device 306 generates the HTML 
table document data for each thread by repeatedly 
performing a series of the above described processes 
in steps 1301 through 1305 until the end of file (EOF) 
is detected in the file of the meta-index 304 (steps 
1306 and 1301). 

When the display device 306 detects the end of 
file (EOF) in the file of the meta-index 304 (yes as 
the result of the determination in step 1306 ) , it 
transmits the finally obtained HTML table document 
data to the Web server, and terminates the operations 
of displaying a keyword view. As a result, a keyword 
view is displayed in a table format shown in FIG. 25 
on the Web browser application in the user terminal 
unit. 

A user can easily find a thread probably 
containing a necessary document in various document 
sets according to a keyword of each thread in a 
keyword view. 

The user can immediately access a root document 
of a desired thread by pointing to an anchor 
corresponding to the root document and clicking a 
mouse button of a mouse device, etc. 

In the above described operation of displaying 
a keyword view, the table records of each thread can 
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be displayed in various colors for a number of child 
documents so that: the user can recognizes the number 
of messages at one glance. 

Described below are the operations of displaying 
a thread view realized by the display device 306. 

In the thread view, as described above, a group 
of documents forming each thread are displayed in the 
display format shown in FIG. 26 in which a user can 
recognize the reference, title, name of the author, 
and the number of lines of a document at a glance. 

In FIG. 26, the reference between threads and the 
transition of topics are displayed by a colored tree. 
A node of each tree corresponds to each document, and 
is displayed in the following format using the leading 
characters (2 bytes) of the name of the author of the 
document and the number of lines of the document. 

name of author [number of lines] 

Symbols such as ■ ' + ', ' = ', etc. are 

added before and after each node. The meanings of 
these symbols are listed as follows. 

A document marked with this symbol is a root 

document . 

1 + f A document referred to by the document 
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marked with this symbol is also referred to by another 
document . 

' = ' There is a document which refers to the 
document marked with this symbol. 

1 . 1 There are no documents which refer to the 
document marked with this symbol. 

In FIG. 26, the 'Main Topic' is followed by the 
title of the root document of the thread, and the 'Sub 
Topic' is followed by the title other than the root 
document in the thread. Each title is uniquely 
colored, and a node corresponding to the document 
having the same title (excluding a reference code) is 
displayed in the same color as the title. 

Thus, the user can understand the structure of 
the entire thread and the transition of topics in a 
thread . 

Furthermore, each node is displayed as an anchor 
so that the user can immediately access the document 
corresponding to the node by pointing to each node and 
clicking a mouse button, etc. 

FIG. 14 is a flowchart showing the operations of 
displaying a thread view performed by the display 
device 306. 

First, the display device 306 reads a list of one 
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entry (one line) corresponding to one thread, and the 
number of documents contained in the thread from the 
file of the thread index 305 having the data structure 
shown in FIG. 6 (step 1401). For example, in the 
thread index 305 having the data structure shown in 
FIG. 6, the following list corresponding to the root 
document number 001 is read, and '6' is read as the 
number of documents. 

(001 (002 003) (004 (005 006))) 

Next, the display device 306 restores, for 
example, the tree structure of the thread indicated 
to the right of the table shown in FIG. 6 (step 1402). 
To represent the tree structure, the display device 
306 generates the array data as shown in FIG. 15. 

Then, the display device 306 extracts an entry 
of the meta-index 304 having the data structure shown 
in FIG. 5 corresponding to a document number forming 
each node of the read list, and extracts from the 
entry the name of the author, the number of lines, the 
color number, and the title (step 1403). The 
extracted data are stored corresponding to respective 
nodes . 

Based on the number of the documents read in step 
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1401 and the color number of each node extracted in 
step 1403, the display device 306 determines the color 
of each title displayed at the head of the thread view 
(step 1404). This operation is realized as an 
operation of mapping actual colors for respective 
color numbers . 

Then, the display device 306 generates an HTML 
document for display of the title of the root document 
contained in a thread and other titles preceded by the 
'Main Topic: ' and the f Sub Topic: ' In this case, each 
title is sequentially read from each entry belonging 
to the above described thread of the color table shown 
in FIG. 12 and managed by the above described 
structure analysis unit 402. Simultaneously, each 
color determined in step 1404 is computed from the 
number of each color sequentially read, and the 
display in each color is sequentially specified. Each 
color is specified by a color specification 
instruction of an HTML document (<font color = > tag, 
etc. ) . 

Finally, the display device 306 generates an HTML 
document for display of a tree structure by 
sequentially processing data from the document number 
of the leftmost node forming part of the array data 
indicating the tree structure of the thread restored 



43 

in step 1402 (step 1406). In this case, as described 
above, the display device 306 converts the document 
number of each node in the tree structure into the 
following display data comprising the leading 
5 character (for example, 2 bytes) of the name of the 
author of a document corresponding to the node and the 
O number of the lines of the document based on the name 

O of the author, the number of lines, and the color 

7f] number extracted in step 1403, and then generates HTML 

~= : 

J! 10 document data for display of the display data in a 

W color corresponding to the color number of the node, 

p The correspondence between a color number and an 

actual color depends on the correspondence determined 
Op in step 1404. Furthermore, as described above, the 

m 15 display device 306 generates HTML document data for 

display of symbols such as ' + ', ' = ', ' . f , etc. 

before or after the above described display data 
corresponding to each node based on the connection of 
the data. To display the tree structure in the format 
20 as is, a Preformatting tag <PRE>, which is a control 
tag in the HTML, is used. Furthermore, the above 
described display data for each node is generated as 
an anchor tag with which the URL to the application 
for display of the document data in the document group 
25 database 301 corresponding to each node is specified 
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as a value of the HREF attribute. 

Then, the display device 306 generates the view 
data for each thread by repeatedly performing a series 
of the above described processes in steps 1401 through 
5 1406 until the end of file (EOF) is detected in the 
file of the thread index 305 (steps 1407 and 1401). 
~; When the display device 306 detects the end of 

O file (EOF) in the file of the thread index 305 (yes 

pn as the result of the determination in step 1407), it 

10 transmits the finally obtained HTML table document 
^ j data to the Web server, and terminates the operations 

G of displaying a thread view. As a result, a thread 

n\ view is displayed in a format shown in FIG. 26 on the 

^; Web browser application in the user terminal unit. 

W 15 Described below is the operation of displaying 

an author view realized by the display device 306. 

As described above, in the author view, the 
titles of documents are classified into author 
(author) units, and the authors are sorted in order 
20 from the highest frequency. Each author is displayed 
with his or her messages arranged in order of date. 
The above described data are displayed in the format 
shown in FIG. 27. 

FIG. 16 is a flowchart showing the operations of 
25 displaying the author view realized by the display 
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device 306. 

The display device 306 uses the author array data 
having the data structure shown in FIG. 17 to realize 
the author view. . Then, the display device 306 
5 initializes the author array data when the display of 
the author view is started (step 1601). 
^ Next, the display device 306 reads data of one 

C- entry from a file of the meta-index 304 having the 

rg; data structure shown in FIG. 5 (step 1602). 

Jr: 10 Then, the display device 306 determines whether 

On or not the author having the name extracted from the 

s 

p entry is an author who is included in the author array 

5f! data (step 1603). 

«[; If the author having the name extracted from the 

W 15 entry is an author who is not included in the author 

array data, then the display device 306 adds a new 
author item to the author array data (steps 1603 and 
1604). If the author having the name extracted from 
the entry is an author who is included in the author 
20 array data, then the display device 306 does not 
perform the process in step 1604. 

Next, the display device 306 enters a document 
number extracted from the above described entry in the 
corresponding author item in the author array data 
25 (step 1605). 
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Then, the display device 306 enters in the author 
array data in author units all document numbers 
entered in the meta-index 304 by repeatedly performing 
a series of processes in steps 1602 through 1605 until 
5 the end of file (EOF) is detected in the file of the 
meta-index 304. 

When the display device 306 detects the end of 
file (EOF) in the file of the meta-index 304 (no as 
the result of the determination in step 1606), the 

10 author items in the author array data are sorted based 
on the number of document numbers entered in 
respective items, that is, the number of message 
documents of each author (step 1607). 

Then, the display device 306 sorts the document 

15 numbers in the order of date extracted from 
corresponding entries in the meta-index 304 in the 
same author item in the author array data. 

Finally, the display device 306 generates HTML 
table document data for display of the name of an 

20 author, the date and the title extracted from the 
entry in the meta-index 304 corresponding to each 
document number in an item for each author item of the 
author array data obtained as a result of the sorting 
operation in steps 1607 and 1608. Then, the display 

25 device 306 transmits the generated data to the Web 



• 
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server and terminates the operation of displaying the 
author view. As a result, the author view is 
displayed on the Web browser application of a user 
terminal in the table format as shown in FIG. 27. 
5 The user can refer to a document in a set of 

documents (message board) from the viewpoint of an 
author and a date of a message in the author view. 

The user can also refer to the message of an 
author in time series, and confirm the leader of 
10 authors at a glance who uploads many messages. 

Other display formats of the display device 306 

Described below are the aspects of displaying 
operations other than the above described operations 

15 of displaying each of the above described views 
realized by the display device 306. 

First, the operation of displaying the contents 
of the message realized by the display device 306 is 
explained below. 

20 As described above, the user can immediately 

access a root document of each thread or a document 
corresponding to each node, etc. by pointing to an 
anchor in each thread in the keyword view or an anchor 
in each node in the thread view and clicking the mouse 

25 button, etc. 
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When these operations are performed by the user, 
the process according to the flowchart shown in FIG. 
18 performed by the display device 306 is activated 
as, for example, a CGI at an instruction from the Web 
5 server. In this case, the process is provided with 
the information having the document number contained 
in the anchor tag specified by the user. 

As a result, after the display device 306 reads 
the information about the above described document 

10 number (step 1801), it reads only the document file 
from the document group database 301 until it reads 
the document file containing the read document number 
described above in the header portion ( loop in steps 
1802, 1803, and back to 1802). 

15 When the display device 306 reads a document file 

containing the read document number described above 
in the header portion (yes as a result of the 
determination in step 1803), it reads data line by 
line from the above described document file in the 

20 loop in steps 1804 through 1809 until it reads the 
header portion of a new document. Then, it converts 
the data into one line of HTML document data, and 
outputs the HTML document data to the Web server ( step 
1808) . 

25 In this case, when the data in each line contains 



49 

the URL for other documents, etc., the display device 
306 outputs the data after converting it into an 
anchor tag specified as a value of an HREF attribute 
using the above described URL (steps 1804 and 1805). 

As a result, the user can jump to other resources 
by pointing to the anchor in the message being 
displayed and clicking the mouse button, etc. 

When the data in each line is a comment line from 
another document, the display device 306 adds to the 
data a tag for a change of color for the data ( steps 
1806 and 1807). 

As a result, the user can recognize a comment 
line at a glance. 

When the display device 306 completes outputting 
corresponding document data, it generates and outputs 
the HTML document in the tree structure of a thread 
containing the above described document, and 
terminates the operation of displaying the contents 
of the message (steps 1809 and 1810). This process 
can be realized as in the above described operation 
of displaying the thread view according to the 
flowchart shown in FIG. 14. 

As a result of the above described displaying 
operations, the contents of a message is displayed on 
the upper half of the display screen, for example, as 
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shown in FIG. 28 on the Web browser application of the 
user terminal unit. The tree structure of the thread 
containing the document including the contents of the 
message is displayed on the lower half of the display 
screen. On the display screen, an anchor for display 
of a keyword view and a thread view, an anchor for use 
in performing retrieval, etc. can be simultaneously 
displayed as shown in FIG. 28. 

The function of switching the views allows the 
documents (messages) in a message board optionally at 
a request from the user, for example, from a keyword 
view to a thread view, the display of the contents of 
a message, an author view, the display of the contents 
of a message, the thread view, . . . , etc. 

Described below is the operations of displaying 
authors and dates in specified colors realized by the 
display device 306. FIG. 19 is a flowchart showing 
the above described operation. 

First, the display device 306 preliminarily 
generates the author array data in which document 
numbers are classified in author items as shown in 
FIG. 20A, and the date array data in which document 
numbers are classified in date items as shown in FIG. 
20B based on the meta-index 304 and the thread index 
305 . The detailed explanation of the generating 
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process is omitted here, but the process can be 
realized as in the process in the flowchart of FIG. 
16. Different colors are assigned to author items in 
the author array data or date items in the date array 
5 data. The assignment of color is determined from a 
total number of authors in each author item or a total 
number of dates in each date item. 

Then, the display device 306 obtains item 
selection button information based on user 
10 specification through a Web server, and determines 
which has been pressed, an author button or a date 
selection button (refer to steps 1902 and 1904 shown 
in FIG. 19 ) . 

If the display device 306 determines that the 
15 author button has been pressed, the display device 306 
generates and outputs the display data for a thread 
tree by referring to the author array data shown in 
FIG. 20A (steps 1902 and 1903). This process is 
realized as in the above described process shown in 
20 the flowchart of FIG. 14. In this case, each node 
of the tree is displayed in a color assigned to the 
author item in the author array data corresponding to 
the name of an author corresponding to the node. 

When the display device 306 determines that the 
25 date selection button has been pressed, it generates 
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and outputs thread tree display data by referring to 
the date array data shown in FIG. 20B (steps 1904 and 
1905). This process is also realized as in the above 
described process according to the flowchart of FIG. 
14. In this case, each node of the tree is displayed 
in the color assigned to the date item in the date 
array data corresponding to the date corresponding to 
the node. 

As a result of the above described displaying 
operations, a tree of a thread uniquely colored for 
each author is displayed, for example, as shown in 
FIG. 29 on the Web browser application of the user 
terminal when the user presses the author button. 
Therefore, the user can recognize at a glance the 
documents written by the same author. 

Described below is the operation of enhancing the 
display of the retrieval result using a thread view 
realized by the display device 306. FIG. 21 is a 
flowchart showing the operation. 

First, the display device 306 generates and 
outputs the HTML document data for display of a post- 
retrieval input form screen (step 2101). As a result, 
a retrieval input form screen as shown in FIG. 30 is 
displayed on the Web browser application of the user 
terminal. The user specifies the execution of 
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retrieval by inputting a word to be retrieved in the 
retrieval input form. 

The word to be retrieved is input in the 
retrieval input form and passed to the character 
5 string retrieval device 405 (FIG, 4) through the Web 
server . The character string retrieval device 405 
searches the entire sentences in each document forming 
part of a thread specified in the document group 
database 301 using the indexed file 404 for the user- 

10 specified word to be retrieved, and outputs the 
document number containing the word to be retrieved 
(steps 2102 and 2103). 

When the display device 306 receives a document 
number containing the above described word to be 

15 retrieved, it displays the HTML document data in the 
tree structure of the thread containing the document 
number in the process according to the flowchart of 
FIG. 14. In this case, the display device 306 
specifies the color of the node containing the above 

20 described document number as an enhanced color ( steps 
2104 and 2105). 

As a result, for example, a thread view as shown 
in FIG. 31 is displayed with enhancement on the Web 
browser application of the user terminal based on the 

25 retrieval result. Thus, the user can recognize the 
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structure of a thread and performs the retrieval. 

Described below is the operation of displaying 
with enhancement a retrieval result using a keyword 
view realized by the display device 306. FIG. 22 is 
5 a flowchart showing the operations. 

First , the display device 306 generates and 
^; outputs the HTML document data for display of the 

2.1 \ 

O post-retrieval input form screen as in step 2101 shown 

fh in FIG. 21 (step 2201). The user specifies performing 

JSSS; 

10 retrieval by inputting a word to be retrieved to a 
yi retrieval input form. 

o The word to be retrieved and input to the above 

1=1 described retrieval input form is passed to the 

^; character string retrieval device 405 (FIG. 4) through 

K 15 the Web server. The character string retrieval device 

405 retrieves all sentences in each document forming 
part of the message board specified in the document 
group database 301 using the indexed file 404 by the 
user specifying a word to be retrieved, and outputs 
20 the document number containing the word to be 
retrieved (steps 2202 and 2203). 

When the display device 306 receives the document 
number containing the above described word to be 
retrieved, it first refers to the thread index 305 
25 having the data structure shown in FIG. 6 and extracts 
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the root; document number corresponding to the entry 
containing the above described document number ( step 
2204). 

Then, the display device 306 displays the HTML 
5 document data for display of the keyword view about 
a specified message board in he process as shown in 
~j the flowchart of FIG. 13. In this case, the display 

O device 306 specifies as an enhanced color the color 

Eg; of the title or the entire display area of the thread 

ST| 10 corresponding to the root document number extracted 

u ' 1 in step 2204, and also specifies the color of a 

O keyword as an enhanced color when the displayed 

?£\ keyword contains a word to be retrieved (steps 2205, 

J 2206, and 2207). 

W 15 As a result, for example, a keyword view as shown 

in FIG. 32 is displayed with enhancement on the Web 
browser application of the user terminal based on the 
retrieval result. Thus, the user can recognize a 
thread containing a word to be retrieved at a glance. 
20 The display device 306 can be designed to display 

the document number as a retrieval result and the 
corresponding title as a listing as shown in FIG. 33. 

Described finally is the operation of displaying 
a keyword view for each sub-topic realized by the 
25 display device 306. 
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In -the above described keyword view, a keyword 
is displayed for each thread. On the other hand, in 
the keyword view for each sub-topic, a keyword can be 
extracted and displayed for each sub-topic in a 
5 thread. 

In this operation, the display device 306 uses 
a sub-topic index having the data structure shown in 
FIG. 24. The sub- topic index has the data structure 
in which a keyword field is added to the data 

10 structure of the color table shown in FIG. 12. 

A sub- topic index is used to actually replace the 
above described color table. Therefore, the contents 
of the field other than a keyword field in the sub- 
topic index are preliminarily entered in the above 

15 described process in step 810 shown in FIG. 8 
performed by the structure analysis unit 402. In this 
case, there can be an entry obtained corresponding to 
each of the main topic indicating the title of a root 
document contained in each root document number 

20 (thread) and the sub-topics indicating the titles of 
other documents in the sub- topic index which is a 
color table as described above about the process in 
step 810 shown in FIG. 8. The display device 306 uses 
the contents of the entries. 

25 FIG. 23 is a flowchart showing the operations of 
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controlling the extraction of a keyword from the sub- 
topic realized by the display device 306. 

First, the display device 306 puts groups of 
documents in each of the combined document files in 
5 main topic units and sub-topic units based on the 
document number entered in each entry contained in 
each thread in the sub- topic index, and inputs a 
resultant combined document file in the contents 
analysis unit 403 (FIG. 4) (step 2302). 

10 The contents analysis unit 403 extracts a keyword 

for each combined document file, and returns a result 
to the display device 306. The display device 306 
enters the keyword for each combined document file 
returned from the contents analysis unit 403 in the 

15 keyword field of the entry corresponding to the above 
described combined document file in the sub-topic 
index (step 2303). 

As described above, a keyword is extracted from 
a group of documents belonging to each thread in main 

20 topic and sub-topic units. 

Afterwards, the display device 306 generates and 
outputs the HTML document data for display of a title 
and a keyword belonging to each title for a thread 
specified by a user in main topic and sub-topic units 

25 for each thread based on the contents of the sub-topic 
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index . 

As a result, a keyword view for each sub-topic 
is displayed in the format as shows in FIG, 34 on the 
Web browser application of the user terminal • Thus, 
the user can precisely limit the topics using a 
keyword . 

Other embodiments of the present invention (Second 
Embodiments ) . 

Described below are other embodiments according 
to the present invention (hereinafter referred to as 
the second embodiments). 

Functions realized by the second embodiment of the 
present invention 

In the second embodiment, the following three 
functions are realized. 

1 . Function of obtaining the entire structure of a 
thread in a small screen 

The entire tree structure can be displayed in a 
predetermined size of screen by curtailing the tree 
of a thread. The second embodiment is described by 
referring to the display on a TTY character terminal. 

Since one node is displayed in one line on the 
character terminal, n nodes can be drawn on a n-line 
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screen. The selection standard of a node to be drawn 
is listed below. 

• number of child nodes referring to the node 

• number of child nodes referring to the node 
5 obtained as a retrieval result 

• node having a title different from the title 
of a root node or a parent node 

G 2. Function of estimating the proceedings of a topic 

ffi in a thread 

J| 10 A specific dialogue such as 'question - answer - 

111 thanks' is detected, and the information is used for 

C display and retrieval, thereby efficiently accessing 

ft-.\ information. 

- ~s* 

J?? I 

~-~[ To be more practical, according to the second 

W 15 embodiment, a topic pattern in a thread can be 

extracted by estimating the attribute information 
(title, author, and reference) of a document and a 
specific sentence pattern characterizing the contents 
of a document. 
20 3. Visualization of user's message pattern 

In network news, the ratio of the number of 
authors to the number of readers is very small. Even 
in a large thread, the number of discussing authors 
is small in many cases. In a specific news group, it 
25 is not rare that useful information is issued by a 
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small number of specific persons. Therefore, 
according to the second embodiment, new views are 
provided from the viewpoint of users who input 
messages by appropriately arranging news and threads. 
5 To be more practical, according to the second 

embodiment, a view is provided from the viewpoint of 

^; users based on the input history of a news group and 

Ci the function of estimating a topic. 

W 

:rj 10 Entire configuration according to the second 

embodiment of the present invention 
□ FIG. 35 shows the configuration of the second 

==j embodiment of the present invention. 

L f/ First, in the preparation phase prior to 

W 15 retrieval, the following operations are performed. 

A document obtaining unit 3502 in a processing 
unit 3501 obtains a group of documents related in 
reference through a network, and stores them in a 
secondary storage device 3503. 
20 A contents estimation unit 3504 generates a 

display index 3505 based on the contents of documents, 
the information pertaining to the documents, and the 
reference among the documents in the group of 
documents stored in the secondary storage device 3503. 
25 A retrieval engine 3506 generates a retrieval 
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index 3507 based on the document contents of the group 
of documents stored in the secondary storage device 
3503. 

For example, in the network news service, 
5 documents are input at any time. Therefore, the above 
described preparation phase is periodically performed, 
for example, once a day to store the latest group of 
documents in the secondary storage device 3503. 

The following operations are performed in the 
10 retrieving process. 

Users issue input instructions through an input 
device 3509. The information to be input includes a 
retrieval keyword, the type of view for display of a 
retrieval result, and the size of a view display area. 
15 A view generation unit 3508 calls the retrieval 

engine 3506 according to the input instruction from 
the input device 3509 to retrieve a corresponding 
group of documents in response to the input 
instruction from the groups of documents stored in the 
20 secondary storage device 3503. 

The view generation unit 3508 generates a result 
view for display of the group of documents retrieved 
by the retrieval engine 3506 using the display index 
3505, and outputs it to a display device 3510. 
25 The above described operations are performed 
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based on the dialogue process between users. That is, 
the users add or change a retrieval keyword on the 
result being displayed, or switch the result view. 

5 Structure of the display index 3505 

In the preparation phase prior to retrieval, the 
following types of indices are generated. 
1 . User index 

This index is used in managing users, and holds 
10 the following information for each entry as shown in 
FIG. 36. 

•User ID: user ID (key) corresponding to the entry 
•Name: user name corresponding to the entry 
•Abbreviation: abbreviation for the user corresponding 

15 to the entry 

•number of messages (number of answers): total number 
of messages of the user corresponding to the entry in 
the message board, and the number of answer documents 
in the Q and A pattern 

20 2 . Document index 

This index is used in managing information for 
each document, and holds the following information for 
each entry as shown in FIG. 37. 

•Document ID: ID (key) of the document corresponding 
25 to the entry 
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•User ID: user ID (key) of the user who generates the 
document corresponding to the entry 

•Title: title of the document corresponding to the 
entry 

•Date: generation date of the document corresponding 
to the entry 

•Number of child and grandchild references: total 
number of documents referring to the document 
corresponding to the entry 

•Path to root: path from the leading message referred 
to by the document corresponding to the entry to the 
document 

•Identification number of title: number indicating the 
order of the title of the document corresponding to 
the entry in the thread (group of documents) 
containing the document 

•Message type: information indicating the type of the 
document, that is, Q (question), A (answer), or T 
( thanks ) , when the document corresponding to the entry 
is contained in the Q and A pattern 
3 . Thread index 

This index is used in managing the information 
for each thread, and holds the following information 
for each entry as shown in FIG, 38. 

•Thread ID: ID (key) of the thread corresponding to 
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the entry 

•Tree structure of thread: reference of a document, 
represented in the document ID list, in the thread 
corresponding to the entry 

•Number of documents: total number of documents in the 
thread corresponding to the entry 

•Number of authors: number of authors of the documents 
in the thread corresponding to the entry 

•UID of largest number of messages: ID of the user who 
issues the largest number of messages in the thread 
corresponding to the entry 

•Contents list: pattern ID list of the Q and A 
pattern, the discussion pattern, and the chat pattern. 
The QA_ID which is the pattern ID of the Q and A 
pattern is entered in an entry in the QA index 
described later. The DS_ID which is the pattern ID 
of the Discussion pattern is entered in an entry in 
the DISCUSS index described below. The CT_ID which 
is the pattern ID in the Chat pattern is entered in 
an entry in the CHAT index. 
4. QA index 

This index is used in managing the information 
of the Q and A, and holds the following information 
for each entry as shown in FIG. 39. 

•QA_ID: ID (key) of the Q and A pattern corresponding 
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•to "the entry 

•Question: field for storing a document ID 
corresponding to the Q (question) document forming 
part of the Q and A pattern corresponding to the entry 
5 -Answer: field for storing a document ID string 

corresponding to the group of A (answer) documents 
forming part of the Q and A pattern corresponding to 
the entry 

•Thanks: field for storing a document ID corresponding 
10 to the T (thanks) document forming part of the Q and 
A 

pattern corresponding to the entry 

•MaxAnswerUID: field for storing the ID (or an ID 
string) of a user (or users) who appears the most 

15 frequently in the authors of each A (answer) document 
forming part of the P and A pattern corresponding to 
the entry, that is, in the authors of the documents 
corresponding to each document ID in the document ID 
string entered in the Answer field in the entry 

20 -Thread ID: ID of a thread containing the Q and A 

pattern corresponding to the entry. This thread ID 
is entered in an entry in the thread index. 
5. DISCUSS index 

This index is used in managing the information 

25 of the Discussion pattern, and holds the following 
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information for each entry as shown in FIG. 40. 
*DS_ID: ID (key) of the Discussion pattern 
corresponding to the entry 

•Message ID list: field for storing a document ID 
5 string of a group of documents forming part of the 
Discussion pattern corresponding to the entry 

*UID: user ID string of a group of documents forming 
part of the Discussion pattern corresponding to the 
entry. 

10 -ThreadlD: ID of a thread containing the Discussion 
pattern corresponding to the entry. This thread ID 
is entered in an entry in the thread index. 
6. CHAT index 

This index is used in managing the information 

15 of the Chat pattern, and holds the following 
information for each entry as shown in FIG. 41. 
*CT_ID: ID (key) of the Chat pattern corresponding to 
the entry 

•Chat list: field for storing the document ID string 
20 of the group of documents forming part of the Chat 
pattern corresponding to the entry 

•ThreadID: ID of the thread containing the Chat 
pattern corresponding to the entry. This thread ID 
is entered in an entry in the thread index. 

25 
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Conf iquration and Operation of the contents estimation 
unit 3504 

Described below in detail are the operations of 
the contents estimation unit 3504 shown in FIG. 35. 

5 

As described above, the contents estimation unit 
3504 generates the display index 3505, that is, a user 
index, a document index, a thread index, a QA index, 
a DISCUSS index, and a CHAT index, based on the 
10 document contents, the information pertaining to 
documents, and the references among the documents of 
a group of documents stored in the secondary storage 
device 3503. 

FIG. 42 is a flowchart showing the operations 
15 performed by the contents estimation unit 3504. 

First, the document index having the data 
configuration shown in FIG. 37 and the thread index 
having the data configuration shown in FIG. 38 are 
generated (step 4201). The detailed description is 
20 omitted here, but the operations are realized by those 
shown in the flowchart shown in FIGs. 8 and 9. In 
this case, the above described meta-index corresponds 
to the document index. Simultaneously, a user index 
having the data configuration shown in FIG. 36 which 
25 relates the user name appearing in each document to 
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the user ID, the abbreviation, and the number of 

messages (number of answers) is generated. 

Next, a thread document group corresponding to 

each entry is read by referring to each entry in the 
5 thread index (step 4202), and the Q and A pattern 

determining process ( step 4203 ) , the Discussion 
~; pattern determining process (step 4204), and the Chat 

O pattern determining process ( step 4205 ) are performed 

rg; for each of the read thread document groups until it 

J?! 10 is determined (step 4206) that the process of the 

^ thread document groups for each entry has been 

S completed. 

fj| FIG. 43 is a flowchart showing the Q and A 

4: pattern in step 4203 shown in FIG. 42. According to 

W 15 the flowchart, a topic pattern is assumed for each 

reference pattern in the thread document group. 

First, one document ID corresponding to a leaf 
document (document at the end of a path) is selected 
by referring to the 'thread tree structure' field of 
20 the corresponding entry in the thread index ( step 
4301) . 

Next, in the document index, the document ID 
satisfying the following condition is retrieved from 
the 'path to root' field in the entry whose document 
25 ID selected in step 4301 is contained in the 'document 
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ID' field (step 4302). 

(condition) In the document index, the 'message 
type 1 field in the entry whose document ID is 
contained in the 'document ID 1 field has not been 
entered yet. 

Then, it is determined whether or not the 
document ID satisfying the above described condition 
has been detected (step 4303). 

If the document ID satisfying the above described 
condition (hereinafter referred to as a process 
document ID) is detected and the determination in step 
4303 is YES, then the document corresponding to the 
process document ID is read from the secondary storage 
device 3503 (FIG. 35), and it is determined whether 
or not a sentence pattern as shown in FIG. 44 exists 
in the document (step 4304). 

If the determination in step 4304 is NO, then 
control jumps to step 4308. 

If the determination in step 4304 is YES, it is 
determined whether or not the document ID satisfying 
the following condition exists in the document IDs 
entered in the 'path to root' field in the entry, in 
the document index, referred to in step 4302 (step 
4305). 

(condition) The document corresponding to the 
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document ID is generated by an author of a process 
document ID, and the document ID is entered in the 
'Thanks 1 field in an entry in the QA index forming 
part of the data shown in FIG. 39. 

If the determination in step 4305 is YES, a 
process document ID is added to the 'Question' field 
in the entry in the QA index referred to in step 4305. 
In the document index forming part of the data 
configuration shown in FIG. 37, a symbol 'Q' is added 
to the 'message type' field in the entry containing 
the process document ID in the 'document ID' field 
(step 4306) . 

Furthermore, in the document ID groups entered 
in the 'path to root' field in the entry referred to 
in step 4302 in the document index, the document ID 
group between the process document ID entered in the 
'Question' field and the document ID entered in the 
'Thanks' field in the entry in the QA index referred 
to in step 4305 are added to the 'Answer' field in the 
entry. Similarly, in the document index having the 
data configuration shown in FIG. 37, the symbol 'A' 
is added to the 'message type' field in each entry 
whose 'document ID 1 field contains each document ID 
entered as described above (step 4306). 

On the other hand, if the determination in step 
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4306 is NO, then, in the QA index, an entry having the 
value of {(maximum value of QA ID in the index) + 1} 
as the field in the ' QA_ID 1 field is generated, and 
the process document ID is entered in the 'Question' 
5 field in the entry. In the document index having the 
data configuration shown in FIG. 37, the symbol 'Q ! 
is entered in the 'message type' field in the entry 
containing a process document ID in the 'document ID' 
field (step 4307). 

10 After the process in step 4306 or 4307, or if the 

determination in step 4304 is NO, it is determined 
whether or not a sentence pattern as shown in FIG. 45 
exists in the document corresponding to the process 
document ID read from the secondary storage device 

15 3503 (step 4308). 

If the determination in step 4308 is NO, then 
control is passed to step 4302. 

If the determination in step 4308 is YES, then 
it is determined whether or not a document ID 

20 satisfying the following condition exists in the 
document IDs entered in the 'path to root' field in 
the entry referred to in step 4302 in the document 
index ( step 4309 ) . 

(condition) The document corresponding to the 

25 document ID is generated by an author of a process 
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document ID, and the document ID is entered in the 
'Question 1 field in an entry in the QA index forming 
part of the data shown in FIG. 39. 

If the determination in step 4309 is YES, a 
process document ID is added to the 'Thanks' field in 
the entry in the QA index referred to in step 4305. 
In the document index forming part of the data 
configuration shown in FIG. 37, a symbol 'T» is added 
to the 'message type' field in the entry containing 
the process document ID in the 'document ID' field 
(step 4310) . 

Furthermore, in the document ID groups entered 
in the 'path to root' field in the entry referred to 
in step 4302 in the document index, the document ID 
group between the process document ID entered in the 
'Thanks' field and the document ID entered in the 
'Question' field in the entry in the QA index referred 
to in step 4309 are added to the 'Answer' field in the 
entry. Similarly, in the document index having the 
data configuration shown in FIG. 37, the symbol 'A' 
is added to the 'message type 1 field in each entry 
whose 'document ID' field contains each document ID 
entered as described above (step 4310). 

On the other hand, if the determination in step 
4309 is NO, then, in the QA index, an entry having the 
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value of {(maximum value of QA_ID in the index) + 1} 
as the field in the ' QA_ID 1 field is generated, and 
the process document ID is entered in the ' Thanks 1 
field in the entry. In the document index having the 
data configuration shown in FIG* 37, the symbol 1 T ? 
is entered in the 'message type 1 field in the entry 
containing a process document ID in the ' document ID f 
field (step 4311). 

After the process in step 4310 or 4311, control 
is returned to step 4302, and the next document ID is 
retrieved . 

If it is determined in step 4303 that a document 
ID satisfying the condition in step 4302 cannot be 
detected after repeating the processes in steps 4302 
through 4311, then it is determined whether or not a 
process on the document ID corresponding to all leaf 
documents has been performed by referring to the 
'thread tree structure' field in the entry being 
processed in the thread index (step 4312). 

If a process on the document ID corresponding to 
all leaf documents has not been performed and the 
determination in step 4312 is NO, then control is 
returned to step 4301, and the topic pattern 
estimation process corresponding to the next path is 
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repeated. 

If a process on "the document ID corresponding to 
all leaf documents has been performed and the 
determination in step 4312 is YES, then the 
5 determination process about the Q and A pattern in 
step 4203 shown in FIG. 42 is completed. 

FIGs. 46 and 47 show an example of a thread 
structure extracted in the determination process about 
the Q and A pattern and a corresponding example of a 
10 group of documents. 

An entry of a user index (refer to FIG. 36) 
corresponding to the author of the document 
corresponding to the document ID of the entry assigned 
the symbol 'A' in the 'message type' field in the 
15 document index is referred to. The contents of the 
'number of messages (answers)' field in the entry is 
updated. 

FIG. 48 is a flowchart showing the operations in 
the Discussion pattern determining process in step 
20 4204 shown in FIG. 42. According to the flowchart, 
a Discussion pattern can be assumed for each reference 
path in a thread document group . 

First, a document ID corresponding to the leaf 
document (document at the end of a path) is retrieved 
25 by referring to the 'thread tree structure' field of 



the corresponding entry in the thread index ( step 
4801). 

Then, it is determined whether or not the 
processes for the document ID corresponding to all 
leaf documents have been tried (step 4802 ) . 

If the processes for the document ID 
corresponding to all leaf documents have not been 
tried and the determination in step 4802 is NO, then 
the 'path to root 1 field in the entry whose 'document 
ID' field contains the document ID retrieved in step 
4801 is referred to in the document index. Then, it 
is determined whether or not the length (number of 
document IDs) from the document ID corresponding to 
the above described leaf document to the root document 
is equal to or larger than 6 (step 4803). 

If the length is not larger than 6 and the 
determination in step 4803 is NO, then the topic 
pattern of the reference path is not assumed as a 
Discussion pattern. Control is returned to step 4801 
to perform a process on the next leaf document. 

If the above described length is equal to or 
larger than 6 and the determination in step 4803 is 
YES, then the number of different user IDs is counted 
in a group of documents corresponding to the document 
ID contained in the 'path to root' field referred to 
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in step 4803 (step 4804). 

Then, it is determined whether or not the ratio 
of the number of the different user IDs to the number 
of document IDs ( total number of documents ) contained 
in the above described 'path to root 1 field is smaller 
than 0.3 (step 4805). 

If the determination is NO, then it is assumed 
that there is no discussions among the small specified 
number of users. Control is returned to step 4801, 
and the process is performed on the next leaf 
document . 

On the other hand, if the determination in step 
4805 is YES, then it is determined that there is a 
discussion among a small specified number of users. 
In the DISCUSS index having the data configuration 
shown in FIG. 40, an entry having the value {maximum 
value of DS_ID in the index) + 1} as a value of the 
f DS_ID f field is generated. Then, in the 'message ID' 
field in the entry, a document ID group in the 'path 
to root' field referred to in step 4803 are entered 
as a list. Based on the entered contents, the 
contents of the ! UID' field and the 'threadID' field 
are entered. Furthermore, in the document index 
having the data configuration shown in FIG. 37, the 
symbol 'D' is entered in the 'message type 1 field in 
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each entry whose 'document ID' field contains the 
above described document ID groups (step 4806). Then, 
control is returned to step 4801 and the process is 
performed on the next leaf document. 
5 If the processes have been performed on the 

document IDs corresponding to all leaf documents and 
O the determination in step 4802 is YES, then the 

0 Discussion pattern determining process in step 4204 

a?i shown in FIG. 42 terminates. 

;r! 10 FIG. 49 shows an example of the thread structure 

0" extracted in the above described Discussion pattern 

p determining process. In FIG. 49, the initial 

~| characters of only a small number of users appear, and 

4y it is guessed that a discussion is made in this 

W 15 thread. 

FIG. 50 is a flowchart showing the Chat pattern 
determining process in step 4205 shown in FIG. 42. 
According to this flowchart, a Chat pattern is assumed 
for each reference path in the thread document group. 
20 First, the document ID corresponding to the leaf 

document (document at the end of a path) can be 
retrieved by referring to the 'thread tree structure 1 
field of the corresponding entry in the thread index 
(step 5001). 

25 Next, as a result of the above described 
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retrieval, it is determined whether or not the process 
on the document ID corresponding to all leaf documents 
has been tried ( step 5002 ) . 

If the process on the document ID corresponding 
5 to all leaf documents has not been tried and the 
determination in step 5002 is NO, then the 'path to 
root 1 field in the entry having the 'document ID f 
field containing the document ID retrieved in step 
5001 in the document index is referred to. Then, it 

10 is determined whether or not the length (number of 
document IDs) from the document ID corresponding to 
the above described leaf document to the root document 
is equal to or larger than 6 ( step 5003 ) . 

If the length is not equal to or larger than 6 

15 and the determination in step 5003 is NO, then it is 
assumed that the topic pattern in the reference path 
is not a Chat pattern. Control is returned to step 
5001, and the process is performed on the next leaf 
document . 

20 If the above described length is equal to or 

larger than 6, and the determination in step 5003 is 
YES, then the number of different user IDs is counted 
in the group of documents corresponding to the group 
of document IDs contained in the 'path to root' field 

25 referred to in step 5003 (step 5004). 
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Then, it is determined whether or not the ratio 
of the number of the above described different user 
IDs to the number of document IDs (total number of 
documents) contained in the above described 'path to 
root' field is larger than 0,6 (step 5005). 

If the determination is NO, then it is assumed 
that a chat is made by a number of users. Control is 
returned to step 5001 and the process is performed on 
the next leaf document. 

On the other hand, if the determination in step 
5005 is YES, then it is assumed that a chat is made 
among a large number of users, and, in the CHAT index 
having the data configuration shown in FIG. 41, an 
entry having the value {maximum value of CT__ID in the 
index) + 1} as a value of the 'CT_ID' field is 
generated. Then, in the 'Chat list' field in the 
entry, a document ID group in the 'path to root' field 
referred to in step 5003 are entered as a list. Based 
on the entered contents, the contents of the T UID f 
field and the 'thread ID' field are entered (step 
4806). Then, back in step 5001, the process is 
performed on the next leaf document. 

If the process has been tried on the document ID 
corresponding to all leaf documents, and the 
determination in step 5002 is YES, then the chat 




80 

pattern determining process in step 4205 shown in FIG. 
42 terminates. 

FIG. 51 shows an example of a thread structure 
extracted in the above described Chat pattern 
5 determining process. It is assumed that various 
initial characters of a large number of users appear, 

JESS. 

^ and a chat is being made in this thread. 

ass. 

_r : 

f5j Principle of the curtailing process of a thread tree 

Ir: 10 Described below is the principle of the thread 

0^ tree curtailing process according to the second 

£i embodiment of the present invention. 

Sf! FIGs. 52 and 53 show examples of displaying a 

^ thread structure. FIG. 52 shows an example of a 

Ei 15 display in a conventional news reader. FIG. 53 shows 

an example of a display in the above described thread 
view shown in FIG. 14. 

In a thread containing only 17 messages, it is 
necessary to scroll the screen to see the entire 
20 thread because of a large number of lines as shown in 
FIG. 52, an overflow in the horizontal direction as 
shown in FIG. 53, etc. Therefore, it is hard to view 
the entire structure. 

FIG. 54 shows an example of a display for n 
25 higher order child nodes (n = 6 in the example shown 
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in FIG. 54) in the thread. The symbol ? + f at the 
beginning of the line indicates that a child node is 
omitted, and the number enclosed by parentheses at the 
end of the line indicates the number of child nodes 
5 (number of referenced child in the document index 
shown in FIG. 37). It is possible to display only a 

C necessary portion by adjusting the value n based on 

q the screen and window size. 

JP; 

FIG. 55 shows an example of a display in which 
!r; 10 nodes having the same title (excluding the leading 

OH f Re: ' ) are omitted in the child nodes. The leading 

P symbol ' +' indicates that the child node is omitted. 

J! At default, the title of the document in the same 

Op thread is the same as the title of the parent node, 

yTj 15 or the leading 'Re: 1 indicating the 'following 1 is 

added. When an author intentionally changes a title, 
it explicitly indicates a change of topic at the 
point . Using this view, a change of topic in the 
thread can be easily recognized. 
20 FIG. 56 shows an example of displaying a thread 

of documents generated in the period from 10/1 to 
10/5. A node indicates the initial character of the 
name of an author. In this view, only the transition 
of a thread with time and the information within a 
25 predetermined time can be displayed. Furthermore, 
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document information can be displayed as related to 
the user ! s schedule and general news in the world by 
arranging on the screen the information with the 
application containing time information such as a 
scheduler, etc. 

FIG. 57 shows the graph structure using an author 
of a thread as a node. A node indicates an initial 
character of the name of an author. A node marked 
with double circles indicates an author of the first 
message of a thread. The density of a link indicates 
the number of messages among the authors. 
Furthermore, the node of a user corresponding to the 
user ID entered in the ID of a user issuing the 
largest number of messages in the corresponding entry 
in the thread index shown in FIG. 38 can be, for 
example, displayed at a high intensity level. It is 
easily understood that this thread is occupied by the 
dialogue between Mr. Ogawa (0) and Partziwal (P). 

Retrieval phase executing operation 

Described below are the operations of performing 
the retrieval phase including the above described 
thread tree curtailing process. 

At the retrieval, the user instructs a retrieval 
request from the input device 3509. 
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FIG. 58 is a retrieval request input screen. 
Input items are listed below. 

•Keyword string (required) contained in a message to 
be retrieved 

5 -Keyword string not to be included in the message to 

be retrieved 

•All messages, only the message corresponding to the 
Q and A pattern as the type of message to be 
retrieved. All messages at default. 
10 -All period, within one month, within on week, etc. 

as date of message to be retrieved. All period at 
default. 

At the lower portion of the input screen shown 
in FIG. 58, the latest date containing (as downloaded) 
15 the message in the secondary storage device 3503 in 
the preparation phase prior to retrieval . 

Listed below are retrieval results. 
•List of threads (refer to FIG. 60) 

•Display of thread structure (including the display 
20 with curtailment depending on the number or 
references, including the display with curtailment 
based on the same title) (refer to FIG. 61) 
•Display of time period thread (refer to FIG. 62) 
•Display of Q and A in combination 
25 -Display of author node graph (refer to FIG. 57) 
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•List: of opinions input by authors 
•display of messages 

These display screens can be switched from and 
to each other* Listed below are the typical display 
5 screens . 

Output result 1: List of threads 

For example, when 'engine 1 is input as a 
retrieval keyword, a thread list screen is displayed 
as shown in FIG, 60. The retrieval results displayed 
in FIG. 60 are listed below. 
•Title of the top message of a thread 
•Name of an author 
•Date 

•Size (number of messages of a thread, entire message 
size) 

•Contents of a thread (QA: Q and A pattern, DC; 
Discussion pattern, CT; Chat pattern) 

At retrieval, a sorting process is performed 
based on the size of a thread, and the top ten threads 
are displayed. When the user clicks the button with 
the 'next ten threads' pointed to, the next ten 
threads are displayed. 

When there are a number of results to be 
displayed, a selective retrieval process can be 
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realized by further adding keywords. 

A screen can be switched to another by the 
following methods. 

•A thread structure is displayed by clicking the 
5 button with the title pointed to. 

•An author node graph is displayed by clicking the 
button with the name of the author pointed to. 
•A time period thread is displayed by clicking the 
button with the date pointed to. 
10 *A pair of Q and A are displayed by clicking the 
button with the Q and A related to the contents of the 
thread pointed to. 

Output result 2: Display of thread structure 
15 FIG. 61 shows an example of displaying a thread 

structure. 

As shown in FIG. 54, a thread structure is 
displayed as a curtailed tree structure based on the 
number of reference nodes. Based on the number of 

20 lines (length in the vertical direction) in a display 
area, a node having a smaller number of reference 
nodes is omitted. Listed below are the contents in 
displaying nodes. The items to be displayed in each 
node are appropriately omitted depending on the number 

25 of digits (length in the horizontal direction) in the 
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display area, 

•The symbol ' +' at the beginning of a line is added 
when there is an omitted child node. 

•A message containing a user-input keyword is 
5 displayed at a high intensity level for the title and 
the name of the author (portion encompassed by a 
rectangle in FIG. 61). 

•Title of a message. 'Re: f is added to a following 

message. 
10 -Author of message 

•Type of message. Q (question), A (answer), or D 

(discussion) is added depending on the topic pattern 

assumed by the contents estimation unit 3504 (FIG. 35). 

•Number of child nodes of the present node. Only 
15 added to the node having the symbol ' +'. 

In a thread, a selective retrieval process can 

be performed with a further keyword specified. 

A screen can be switched to another by the 

following methods. 
20 -The body of a message is displayed by clicking the 

button with the title pointed to. 

•An author node graph is displayed by clicking the 
button with the name of the author pointed to. 
•A thread structure is displayed as a tree structure 
25 curtailed based on the same title by clicking the 
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button with a list of titles pointed to. 

Output result 3; Display of time period thread 

FIG, 62A shows an example of displaying a thread 
in a predetermined time period. A message containing 
a keyword is displayed with a black circle. 
Otherwise, a message is displayed with a gray circle. 
The initial character of the name of an author is 
added below each node. On this screen, the display 
period for date, the vertical and horizontal display 
of date, the window size, the cell width, etc. are 
variable. Therefore, for example, a scheduler can 
match another scheduler in size. For example, FIG. 
62A shows a scheduler and FIG, 62B shows another, and 
they match in cell width for time period thread 
display. 

A screen can be switched to another by the 
following methods. 

•The body of a message is displayed by clicking the 
button with the title pointed to. 

Output result 4: QA combination display 

The QA combination display refers to a 
combination of a question and an answer which 
corresponds to the Q and A pattern assumed by the 
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contents estimation unit 3504 (FIG- 35) and is 
displayed as a table. A line of the table displays 
the following information. 

•Title 

•Questioner 

•Answerer 

A screen can be switched to another by the 
following methods. 

•A thread structure is displayed by clicking the 
button with the title pointed to. 

•An author node graph is displayed by clicking the 
button with the name of the author pointed to. 

Output result 5: Author node graph 

An author node graph refers to a graph relating 
to the dialogue of authors of each message in the 
thread. FIG. 57 shows an example of a display. 

A screen can be switched to another by the 
following methods. 

•A list of an author's input opinions is displayed by 
clicking the button with the name of an author pointed 
to. 

•A thread structure is displayed by clicking the 
button with the link pointed to. 
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Output result: 6: List: of opinions of an author 

A list of opinions of an author refers to a 
screen through which a list of messages input by each 
author can be displayed. The date, the title, and the 
type of message (Q, A, D) are displayed in order of 
date. 

A screen can be switched to another by the 
following methods. 

•The body of a message is displayed by clicking the 
button with the title pointed to. 

Output result 7: Body of message 

It refers to the body of message. The link to 
the name of an author, the title, the date, and the 
parent message is also displayed. 

A screen can be switched to another by the 
following methods. 

* A thread structure is displayed by clicking the 
button with the title pointed to. 

•A time period thread is displayed by clicking the 
button with the date pointed to. 

•An author node graph is displayed by clicking the 
button with the name of the author pointed to. 

Operation of time period thread display 
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FIG* 63 is a flowchart showing the operations of 
the time period thread display performed by the view 
generation unit 3508. 

First, in the thread index having the data 
configuration shown in FIG. 38, one document ID 
contained in a thread is selected by referring to the 
'thread tree structure' field in the entry 
corresponding to the thread to be displayed (step 
6301). 

Next, in the document index having the data 
configuration shown in FIG. 37, the 'date' field in 
the entry corresponding to the above described 
document ID selected is referred to, and the data is 
entered in the calendar index having the data 
configuration shown in FIG. 64 (step 6302). 

Then, it is determined whether or not the process 
has been performed on all document IDs in the ' thread 
tree structure 1 field in the entry corresponding to 
the thread to be displayed in the thread index ( step 
6303). 

If the process has not been tried on all document 
IDs, and the determination in step 6303 is NO, then 
control is returned to step 6301 and the process on 
the next document ID is repeatedly performed. 

If the process has been tried on all document 



IDs, and the determination in step 6303 is YES, then 
document nodes are mapped on the calendar by referring 
to the calendar index having the data configuration 
shown in FIG. 64. The edge of reference is displayed 
by referring to the thread index. 

Author node graph display operations 

FIG. 65 is a flowchart showing the operations of 
the author node graph display performed by the view 
generation unit 3508 (FIG. 38). 

First, in the thread index having the data 
configuration shown in FIG. 38, one document ID 
contained in a thread is selected by referring to the 
'thread tree structure 1 field in the entry 
corresponding to the thread to be displayed. Next, 
by referring to the document index having the data 
configuration shown in FIG. 37 and the user index 
having the data configuration shown in FIG. 36, a user 
ID of the parent document (parent message) of the 
document corresponding to the selected document ID is 
obtained ( step 6501 ) . 

Next , it is determined whether or not an entry 
corresponding to the above described set exists in the 
author array having the data configuration shown in 
FIG. 66 (step 6502). 



If the entry exists, control is passed to step 

6504. 

If the entry does not exist, an entry of the 
horizontal or vertical axis of the author array is 
added (step 6503). 

Then, the number for the above described entry 
is incremented by 1 (step 6504). 

Then, it is determined whether or not the process 
has been performed on all document IDs in the 1 thread 
tree structure' field in the entry corresponding to 
the thread to be displayed in the thread index (step 
6505). 

If the process has not been tried on all document 
IDs, and the determination in step 6505 is NO, then 
control is returned to step 6501 and the process on 
the next document ID is repeatedly performed. 

If the process has been tried on all document 
IDs, and the determination in step 6505 is YES, nodes 
are drawn, as shown in FIG. 67, for the number of 
entries of the author array having the data 
configuration shown in FIG. 66, and the symbol is 
drawn between the parent and the child. The thickness 
of the line depends on the frequency of the dialogue 
between the parent and the child (step 6506). 

In the above described second embodiment, a 
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thread can be displayed depending on the range of the 
screen display by curtailing the thread structure 
curtailment . 

Since a retrieval result is automatically 
displayed with an assumed topic, the user can easily 
understand the summary of the retrieval result even 
when there are a large number of threads as a 
retrieval result. 

Furthermore, although there is a large volume of 
documents in a thread, the same author may have input 
his or her opinions many times. By providing views 
based on authors, a key person in the thread can be 
easily detected and the entire configuration of the 
thread can be compactly displayed. 

Supplement to storage medium storing a program for 
realizing a preferred embodiment of the present 
invention 

The present invention can also be designed as a 
computer-readable storage medium used to direct a 
computer to perform the functions realized by each of 
the configurations according to the preferred 
embodiments of the present invention. 

In this case, for example, a portable storage 
medium 6802 such as a floppy disk, a CD-ROM disk, an 
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optical disk, a removable hard disk, etc., and a 
program for realizing each step according to the 
preferred embodiment of the present invention through 
a network line 6803 are loaded onto memory (RAM, a 
5 hard disk, etc. ) 6805 in a body 6804 of a computer 
6801 for execution as shown in FIG. 68. 
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